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Abstract 

The Stevens classification of levels of measurement involves four types of scale: "Nominal", "Ordinal", "Interval" and "Ratio". 
This classification has been used widely in medical fields and has accomplished an important role in composition and 
interpretation of scale. With this classification, levels of measurements appear organized and validated. However, a group 
theory-like systematization beckons as an alternative because of its logical consistency and unexceptional applicability in 
the natural sciences but which may offer great advantages in clinical medicine. According to this viewpoint, the Stevens 
classification is reformulated within an abstract algebra-like scheme; 'Abelian modulo additive group' for "Ordinal scale" 
accompanied with 'zero', 'Abelian additive group' for "Interval scale", and 'field' for "Ratio scale". Furthermore, a vector-like 
display arranges a mixture of schemes describing the assessment of patient states. With this vector-like notation, 
data-mining and data-set combination is possible on a higher abstract structure level based upon a hierarchical- 
cluster form. Using simple examples, we show that operations acting on the corresponding mixed schemes of 
this display allow for a sophisticated means of classifying, updating, monitoring, and prognosis, where better data 
mining/data usage and efficacy is expected. 

Keywords: Scales on measurement, Stevens classification, Interpretation, Abstract algebra, Data-mining, 
Hierarchical cluster, Clinical medicine 



Background 

In 1946, S. S. Stevens devised his classification of "levels 
of measurement" [1], which subsequently has been used 
widely and has accomplished an important role in com- 
position and interpretation of scales in medical fields. 
The systematics of levels of measurement seems to have 
been organized and validated by virtue of this classifica- 
tion. Nevertheless, we believe that an abstract algebra- 
like interpretation/systematization awaits introduction 
because of its logical consistency and unexceptional applic- 
ability in describing patterns and processes. We conjecture 
that it offers benefits in clinical medicine, especially, with 
respect to scales of measurement [2,3] . 

Thus, in the following, we re-interpret Stevens classifica- 
tion, and endeavour to give it meaning in some abstract 
algebra-like modelling. There, the most preferred construct 
is a vector-like structure of various sets of scores based on 
individual scales and operators that permit changes of 
score within the set. Additionally, classical datasets that are 
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classified in terms of the Stevens scales of measurement 
can be mined and combined on a higher abstract structure 
level based upon a hierarchical-cluster form. To explore 
this possibility, we provide simple examples to help readers 
understand this modelling tool. 

§1. Application of group/field of abstract algebra to the 
various types of scales 

Stevens classified the scales of measurement into four 
scale types [1]; I) "Nominal scale" that uses only labels or 
numbers (e.g., numbering of football players, blood type, 
nationality); II) "Ordinal scale" that introduces equality, 
rank-ordering (e.g., hardness of minerals, grading for effi- 
cacy of clinical treatment); III) "Interval scale" that is 
based on equally quantitative intervals (e.g., temperature 
as read in centigrade, duration, frequency); and IV) "Ratio 
scale" that assumes a 'zero' as an origin, equality, rank- 
order, equality of intervals, and equality of ratios (e.g., 
absolute temperature, speed of vehicles, and most physical 
values) that then admit manipulations using the four 
arithmetic operations. 
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For I), the "Nominal scale", there seems to be little 
room where group theoretical operations apply because 
within that scale only a labelling scheme is permissible. 
Although some non-cyclic group might be definable, it 
seems that little meaning can be attached to operations 
for this sort of scale. 

For II), the "Ordinal scale", a ranking is realised by 
introducing a set with an N-graded scoring like '1, 2, 
3,..., N — 1, N' (N: positive integer) for a score defi- 
cient in (or with no absolute need for) a quantitative 
character, but not requiring a '0' score according to 
the Stevens classification. Historically, the "graphic 
rating scale", a grading from I to V, was proposed by 
Hayes and Patterson in 1921 [4], and Freyd in 1923 
[5]. However, here, we envisage either operations that 
decrease the score by '1' in an N-graded graphic scale 
necessitating a '0', so that {0, 1, 2, 3,..., N - 2, N - 1} 
establishes the scoring scale, or simply adding the 
score '0' as in {0, 1, 2, 3,..., N - 2, N - 1, N}. We focus 
on the former type. Then, for an arbitrary non- 
negative integer X, the operation giving the remainder 
of X after division by N, written X (mod N), defines 
the cyclic group Z N = {0, 1, 2, 3,..., N - 2, N - 1}, 
where modulo N addition is postulated. With this as- 
sumption, given two elements Xj' and 'X k ' (Xj, X k e Z N ) 
corresponding for example to the severity of a clinical 
symptom and/or finding, then composition (denoted 
by '*') is taken to be modulo N addition; 'Xj*X(j^ k ) = 
X k ' (with Xrj^jj) e Z N ). Here 'X(j^ k )' is an operator that 
produces the change in score, 'Xj — » X k ' (formally we 
have 'X(j_,ic) = Xj" *Xk = X k - Xj'). Then, all scores 'X/s 
and operators 'X(j_»k)' are composable within a single 
Abelian modulo additive group 'Z N ', where 'Xj*X k = 
X k *Xj' holds, at least, in terms of operation '*'. Thus a 
patient's state corresponding to a certain illness or disease 
can be changed through the application of a single oper- 
ation determined by the two elements belonging to 'Z N ' 
[6,7] representing the previous and current state of the pa- 
tient. A simple example is presented in Appendix A. 

If a state of maximum severity is present, then the 
antithesis for any given disease Y is the ideal healthy 
state E Y = [0|0|0|0|0|...], the combination of all scores 
being '0' and represented by the identity element for 
group Y = {Z N xn , *}. Here, Y is the n-fold Cartesian 
product of 'Z N ' (n: the number of components) that 
comprises all possible assessments related to each state 
of a given disease, for instance, 'hypertension! 'hypergly- 
caemia! 'diabetes mellitus', 'acute pancreatitis', 'systemic 
lupus erythematosus', and 'cerebral artery stroke'. If in 
addition composition is given by modulo 'N' arithmetic, 
prime numbers (e.g., N = 7) are preferable [8,9] and 
considerable parts of components could be overlapping 
among individual diseases as was mentioned in our 
previous reports [6,7]. Note that, in practice, equal 



increments within a grading scheme are not always 
postulated. Nevertheless, the scale represented by this 
Abelian modulo additive group 'Z N * n ' will be called a 
"modular scale". However, it may be an atypical case 
(partially weakened example) of a "Ratio scale" (type IV) 
without the strict requirement for equal calibration. In- 
deed, there are such scales because, like the 'TNM clas- 
sification (with a 'TO' entry) for malignant tumours' 
[7,10], grades for scoring are determined for example 
according to histological characteristics, selection of 
treatment, and prognosis, having no strict linearity in 
scale, but which might be regarded as an "modular 
scale". Based upon these results, for instance, the fol- 
lowing are considered composable; Abelian modulo 
additive group Y 1 = {Z 7 , *} for 'hypertension', Y 2 = {Z 7 , *} 
for 'hyperglycaemia', Y 3 = {Z 7 , *} for 'diabetes mellitus', 
Y 4 for 'acute pancreatitis', Y 5 for 'systemic lupus erythe- 
matosus', Y 6 for 'cerebral artery stroke', Y a n = {Z 7 x Z 7 x 
Z 7 x *} = {Z 7 * n , *} (n: the number of components) for 
an entire body, and Y 7 = {Z 8 x Z 4 x Z 2 x Z 2 , *} for the 
'TNM classification (with a 'TO' entry) for malignant tu- 
mours' [7,10]. Additionally, these are treatable without ex- 
ception within the abstract algebraic theory. For this case, 
an equal calibration for severity may have unbeneficial 
outcomes if used in clinical treatments. However, for 'de- 
lirium! 'chronic liver dysfunction! 'acute pancreatitis! and 
'diabetes mellitus! for example, total scores based on equal 
calibration are desirable to assess disease severity. 

For III), the "Interval scale", differences in quantities 
are allowed. An example is 'periods of time' or 'duration', 
which, although can be measured with ratio scales, en- 
ables one period to be double another when compared. 
The same is true of 'temperature'. If parameters 'Xj' and 
'Xi' e R (the continuous real number line) have ranges 

-co < X < +cx, (i) 

we can consider an operator 'X k ' that causes changes 
from 'Xj' to 'Xi! and introduce a binary operation, de- 
noted '•>', where ordinal addition and its inverse, subtrac- 
tion, are assumed; 

Xj °X k = Xj + X k = Xi (j , k, 1; session numbers) (ii) 

In this regard, as for 'Xj! it can also be expressed as a 
sum of an integer part and a decimal part, 

Xj = lirij + Cj (iii) 

(irij = [Xj], Cj = Xj - [Xj], '0 < Cj < 1'; '[X]' is the floor func- 
tion meaning the highest integer below X'). Similarly, 

X k = lm k + c k (m k = [X k ] , c k = X k - [X k ] , '0 < c k < 1') (iv) 
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X! = lmi + q (mi = [Xi], q = Xi-[Xi],'0 < q < 1') (v) 

'1' is a 'unit length' of the respective values. Thus, (iii) - 
(v) can be redefined using the unit length T' as an interval 
scale, 

X r X k = (lmj + Cj ) + (lm k + c k ) 

= l(mj + m k ) + (cj + Ck) = lmj + c\ 

= Xj ' (vi) 

There exists an identity element 'X 0 ' (=0) that satisfies 
'Xj °X 0 = X 0 °Xj (=Xi + X 0 = Xj + 0) = X/. Additionally, the 
inverse element is 'Xi" = -Xj' satisfying 'Xj~ lo Xj = Xj "X^ 1 = 
Xj + Xj-^Xj-X^Xo^O)'. 

Naturally, commutativity and associativity are satisfied. 
Let U be the set that comprises all 'Xj's, i.e., U = {Xj | Xj 
e R}. Because 'Xj , X k , Xj e set U, the closure law holds. 
Therefore, this operation defines a group U = {Xj, °} 
[2,3]. "Body temperature readings", "clock time for the 
onset of sleep within a day" and "clock time for the onset 
of drip infusion within a day" are definable in this scale. 
Examples of the first two are provided in Appendix B. By 
making use of this procedure, the differences between 
quantitative values and operators are eliminated, and 
both can be regarded as elements belonging to a single 
group U. Moreover, a collection of additive Abelian 
groups Ui = {Xij | Xij € R (deg C)} based on an individ- 
ual's clinical values can be described as, as for example 
Ui = {Xij, °} for "body temperature readings", and U2 = 
{X 2j I X 2j e R (/24 hrs)} and U 2j = {X 2j , -} for "clock time 
for the onset of sleep within a day", U 3 = {X 3j | X 3j e R 
(/24 hrs)} and U 3 j = {X 3j , °} for "clock time for the on- 
set of drip infusion within a day",..., Un = {X Nj , •>},..., 
(N: natural number). Those are considered readily 
treatable and recordable within an abstract algebraic 
context. 

For IV), the "Ratio scale", the 'administration of medi- 
cine (with strict dosage regimes)' and 'International Stat- 
istical Classification and Health Related Problems' [11] 
were given as examples in our previous report [6,7]. Es- 
sentially, for this scale, because the four arithmetic oper- 
ations are possible, 'rings' and 'fields' in abstract algebra 
are applicable so long as composition is given by modulo 
'N' arithmetic with 'N' a prime. Although there could be 
scope where the four modulo arithmetic operations 
(denoted by 't' in 'XjtX k = X{) are applicable in assess- 
ment scoring in clinical medicine, it might be preferable 
at this stage to confine the application of ratio scales to 
just modulo N addition '*' collectively for 't', similar in 
manner as established in Appendix A. For the example 
given in Appendix A, the difference in interpretation is 
the presence/absence of an equal calibration. 



Whereas the scale of 'TNM classification for malig- 
nant tumours' [10] was regarded as an example of an 
"Ordinal scale", some of the scales defined as "Ratio 
scales" at initial glance should be regarded as "Ordinal 
scales" accompanied with '0'. It might be contentious 
whether clinical assessments performed using superficial 
scales based on the four arithmetic operations could 
have sufficient validity in clinical treatments or clinical 
research. 

Nevertheless, other clinical scales range over a semi- 
open continuous interval like '0 < X < +°°' (X: real num- 
ber), such as 'blood concentration of white blood cells: 
[WBC] (/mm )', and 'administration of a certain drug 
like lithium carbonate: [Li + ] (mEq/1), sodium: [Na + ] 
(mEq/1), calcium: [Ca ++ ] (mg/dl), chloride: [CP] (mEq/1) 
and bicarbonate: [HC0 3 ] (mEq/1)'. Also, there are clin- 
ical scales whose ranges are the open interval like '-°° < 
X < +00' (X: real number); Anion gap [AG] = [Na + ] - ([CT] + 
[HC0 3 ]) (reference range for blood tests: 12 ± 2 mEq/1)' 
and 'Base excess [BE] (reference range for blood tests: 

0 ± 2 mmol/1)'. However, both can be treated using the 
notion of 'field' because those values are real numbers 
where all four arithmetic operations are included, with 
the exception of division by zero. Thus, the above clin- 
ical values could be definable over a 'field'. In this re- 
gard, we assume a rule that each unit like 'mEq/1' 
accompanies the value automatically with the results of 
operations regardless of types of operation among the 
four arithmetic operations (Note that there are cases 
when units vanish as when ratios are taken 'mEq/mEq 
(unitless)' or displayed in reciprocal form like '1/mEq'). 
Examples for '[WBC] (/mm 3 ); '[Na + ] (mEq/1)' are pre- 
sented in Appendix C. 

In this case, we consider a set V and assume that '#' 
means one of 'addition, subtraction, multiplication, and 
division' collectively; thus, 'Xj # X k = X t (eV), where or- 
dinal arithmetic calculations are performed excluding of 
course division by zero. 

For set V, addition is commutative: Xj + X k = X k + Xj, 
and associative: (Xj + X k ) + Xi = Xj + (X k + Xi). As for 
multiplication, set V meets the conditions of a 'monoid' 
[2,3]. Associativity: (Xj x X k ) x X[ = Xj x (X k x X t ), with 
Left and Right Distributivity: Xj x (X k + Xj) = Xj x X k + 
Xj x Xi, (Xj + X k ) x Xi = Xj x Xi + X k x Xj. A nonzero Iden- 
tity X 0 (=1) for multiplication exists. The Inverse 'Xj" = 
1/Xj' satisfies 'Xj x X. rl = Xj" 1 x Xj = X 0 (=1)'. For division, 
'Xj/X k = Xj x X k =1' is definable except for division by 
zero. Therefore, we can confirm that set V is a 'field'. It 
can be expressed as V = {Xj, #} or V = {Xj | Xj g R}. 

Furthermore, different fields based on different sets of 
clinical values can be described as follows: field Y 1 = {X^ 

1 Xij e R (/mm 3 )} and Vx = {Xxj, #} for "blood concentra- 
tion of white blood cells: [WBC] (/mm 3 )", field V 2 = {X 2j 
I X 2j e R (mEq/1)} and V 2 = {X 2j , #} for "administration 
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of a certain drug like lithium carbonate: [Li + ] (mEq/1)", 
field V 3 = {X 3j | X 3j e R (mEq/1)} and V 3 = {X 3j , #} for "so- 
dium: [Na + ] (mEq/1)", field V 4 = {Xtj, #} for calcium: [Ca ++ ] 
(mg/dl), field V 5 for chloride: [CT] (mEq/1), field V 6 for 
'Anion gap [AG] (mEq/1)', field V 7 for 'Base excess [BE] 
(mmol/1)',..., V N ,..., (N: natural number). For each, an 
independent abstract algebraic treatment is possible as for 
ordinal abstract algebra. 

§2. A vector-like notation using group/field operations 
belonging to a single set 

By making use of all types of scales of measurement, 
we propose a vector-like expression of a patient's state 
(denoted 'Rj! j = 1, 2, 3,...: number of sessions), where 
the mixed expression and its totality of operations that 
could be performed belong to a single set R. Because of 
the possible variety of operation rules, the genuine use 
of this set may be unwieldy at this stage. 

Partially based upon our previous description [6,7], let 
us define 'Rj' to be a vector of five clinical values, 

R; = [severity for depression (within modulo 7 arith- 
metic) | clock time for the onset of sleep (/24 hrs) | 
blood concentration of white blood cell [WBC] (/mm 3 ) | 
blood concentration of [Na + ] (mEq/1) | a certain value (a 
certain operational unit)], 

= [X 0)1 (mod 7) | X 0)2 (/24 hrs)| X 0)3 (/mm 3 ) (vii) 
|X (j)4 (mEq/l)|X (j)5 (...)] 

Next, suppose the patient's state 'Rj' changes to 'Rj+i' 
effected by operator 'R ( j^j +1) '; we denote by '0' the binary 
composition composed of the product of compositions 
for each component. Three possible states are: 

Ri = [(X(i)i =) 2 (mod 7) | (X (1)2 =) 21 (/24 hrs) | (X (1)3 =) 5000 
(/mm 3 ) | (X (1)4 =) 145 (mEq/l)|X (1)5 (...)] 

R 2 = [( x (2)i =) 5 (mod 7) | (X (2)2 =) 19.5 (/24hrs)| (X (2)3 =) 18000 
(/mm 3 )| (X (2)4 =) 128 (mEq/l)|X (2)5 (...)] , 

R 3 = [(X (3) i =) 3 (mod 7) | (X (3) 2 =) 22 (/24 hrs)| (X (3)3 =) 7000 
(/mm 3 ) | (X (3) 4 =) 158 (mEq/l)|X (3)5 (...)]. 

For the 1st component, 'X( 1 ) 1 *X( 2 )i', and 'X (3)1 ', modulo 
7 arithmetic (addition) is used. For the 2nd compo- 
nents, 'X( 1 )2, X( 2 )2, X( 3 ) 2 ', operations of Abelian addition 
are used. For the 3rd component, 'X( 1)3 , X (2 ) 3 , X( 3)3 ', 4th 
'X(!)4, X( 2 )4, X( 3 ) 4 ', the four arithmetic operators (those 
operations denoted by '#') are required, and for the 5 th , 
'X( 1)5 , X( 2 )5, X( 3 ) 5 I a certain operational unit is postulated. 
In the following examples, only addition/sub traction is 
presented; naturally, multiplication/division is also consid- 
ered permissible. 



With related operators R(i^ 2 ) = [X(i^ 2 )i(mod7) | X ( i_ 2)2 
(/24 hrs) | X (1 _ 2)3 (/mm 3 )| X (1 ^ 2)4 (mEq/l)|X (1 ^ 2)5 (...)], 
and R (2 ^ 3) = [X (2 ^ 3)1 (mod7) | X (2 ^ 3)2 (/24 hrs)| X (2 ^ 3)3 
(/mm 3 ) | X (2 ^ 3)4 (mEq/l) | X (2 ^ 3)5 (. . .)] 

Then, using results in Appendix D, 'R(i^ 2 )' and 'R( 2 _> 3 )' 
from the three states given above are as follows: 

R (1 ^ 2) = [3 (mod 7) |-1.5(/24hrs)|13000(/mm 3 )| 
-17(mEq/l)|X (1 ^ 2)5 (...)] 

R (2 ^3) = [5(mod7) |2.5(/24 hrs)|-11000(/mm 3 ) | 
30(mEq/l)|X (2 _ 3)5 (...)] 

Thus, we confirm the relation 

Ri0R(i-2)0R( 2 -,3) = Rs (viii) 

Details are illustrated in Appendix E. 

Note that, in general, there exists an identity 'E (=Ro) = [0 
(mod 7)| 0 (/24 hrs)| 0 (/mm 3 )| 0 (mEq/1) | X 0 (...)]' such 
that 'RjOE = EORj = Rj'. Additionally, there exists an inverse 
for any 'R- 'R^ 1 = [X(j)l(mod7) | X 0)2 (/24 hrs)| Xg )3 
(/mm 3 )| Xg^mEq/l)!^...)] = [7-X 0)1 (mod 7) | 24-X (j)2 
(/24 hr^l-X^/rnm^l-Xy^mEq/lJlX^...)]' that 
satisfies 'R^ORj = RjOR^ 1 = E'. However, commutativity, 
'RjORk = R k 0Rj' and associativity, '(RjOR k )ORi = RjO(R k ORi)' 
are not satisfied. Here, we assume that operators acting on 
'Rj's should be performed from left to right, that is, from 
Ri to R m (m; number of session for assessment). They 
should not be applied between 'Rj's. For any assortment of 
'Rj's with scales of measurement among types I)-IV), a 
single set R = {Rj| X (j) i x X (i)2 x X (j)3 x X 0)4 x X ( j )5 } (V 
means products among groups and fields) using a 
vector-like notation for the scoring of patient states can 
be structured where all possible assessments and/or 
clinical findings of the patient and treatment are in- 
cluded. The general form is the n-fold product; set R = {Rj | 
X(j)i x X 0)2 x X(j )3 x X(j )4 x ...xX(j)( n _ 2 ) x X(j)( n _!) x X(j )n } (n; the 
number of components). 

As for the possible application to better data mining 
or data usage from the viewpoint of our reinterpretation, 
we provide a simple example that may help readers to 
follow an outline of the argument. Consider an example 
of 17 states "Ri, R 2 , R17" (eset R) each with four com- 
ponent ('n = 5') and arrows (only symbols) that indicate 
the possible changes among the 'Rj's, as displayed in 
Figure 1. The scheme covers the notation of our model, 
and also that of existing methods where (possible) re- 
sults of data, 'Rj's, are not combined directly with each 
other in the sense of operations. Then, the arrows could 
be re-displayed according to our concepts as operators 
'R(j_,k) that can be regarded as elements 'Rj' belonging 
to a set R as in Figure 2. In ordinal data sets, the 'Rj's are 
merely a collection of values and the arrows in Figure 1 
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R 7 - 



Rj ► R 2 ► R 3 ► R 4 ► R 5 



R 10 ►Rjj ►Rj, 



R 13 «-R 14 -R 15 -R l6 



Figure 1 Example of a tree composed of a data set and ordinal 
arrows. The tree represents changes of states between 1 7 'Rj's data 
elements. In ordinal existing methods, data are only a collection of 
results that are not directly combined; manipulations of parts of the 
data are defined separately. The arrows merely indicate a change from 
one state 'Rj' to another TV and have no specific operational sense. 



0 R^^gO Rg_>9 (— R9) 



R 1 0Rw 2 0R2^30R 3 ^ R 4^^ R 5^6 (= R 6 ) 



0 

R-?_,m aR 



0 
R 



12^4 



2-10 o K io-n 0 R-n-12 (= R, 2 ) 

0 

R 10 ^13 0 R 13^14 0 R 14^15 0 R 15^16 (= R l&) 

0 

R 13^17 

(= R n) 

Figure 3 "Operational tree" — the compositional scheme using 
symbol '0'. The tree of Figure 2 is re-illustrated using composition 
symbol '(>', where the operators are assumed to belong to the single 
set R. By admitting algebraic correspondences, this compositional 
scheme could potentially provide better data mining/data usage. 



are only marks. However, in our interpretation, all 'Rj's 
and 'R(j^ k) 's are elements of a single set R subject to ax- 
ioms of an abstract algebra as indicated using composition 
symbol '0' in Figure 3. There, the changes 'from Rj to 
R k ' can be traced at each session. Displayed in this way, 
Figure 3 represents an "operational tree" that could offer 
potential for better data mining/data usage through a 
more generalized/concise treatment (e.g., withdrawing/re- 
cording correspond to schemes in Figure 3) that might be 
permissible. Practical improvements for efficacy, however, 
will need future investigations. 



R i-p — " R 2 




R l-2 2 R 2^3 



^2^10 
Rin_ 



n R 7^8 . p R 8^9 . p 
K y ► Kg ► K, 



R 3 ► R 4 



R 13^14 p R 14^15 D ^15^16 r> 
K 13 *' K 14 *' K 15 "• K 16 



R,7 

Figure 2 Scheme of the tree with arrows labelled by operators. 

Arrows are interpreted as operators 'R(j^k)' s that could be regarded 
as ordinal elements 'Rj's belonging to a single set R. Each operator 
that changes 'Rj to R k ' can be traced and its degree for each session 
identified from initial and final states. The final states (R 6 , R g , R 16 and 
R, ? ) can be traced back to any initiating state 'R q ' by performing an 
appropriate sequence of 'Rg^'s. 



Here, consider the scenario of Figure 1 where from an 
initial value 'Ri' there are four outcomes 'R 6 ^ 'R 9 ', 'Ri 6 ', 
and 'R17' containing nodes at 'R2' 'R4' 'Rio' 'R12' and 'R13'. 
By making use of our previous examples 'Ri - R 3 ) the 
next simplest examples with 'n (component number) = 5' 
can be confirmed easily: 

Rio = [0 (mod 7) 1 17 (/24 hrs)|9000 (/mm 3 ) 
|130 (mEq/l)|X (10 ) 5 (...)] 

Rn = [6 (mod 7) |20 (/24 hrs)[20000 (/mm 3 ) 
|149 (mEq/l)|X (11)5 (...)] 

R12 = [4 (mod 7) 1 23 (/24 hrs)[6000 (/mm 3 ) 
|140 (mEq/l)|X (12)5 (...)] 

R13 = [1 (mod 7) 1 18 (/24 hrs)|5000 (/mm 3 ) 
|135 (mEq/l)|X (13 ) 5 (...)] 

Ri 7 = [2 (mod 7) [23.5 (/24 hrs)|3000 (/mm 3 ) 
[150 (mEq/l)[X (17)5 (...)] 

Following these results, the next relations, according 
to the tree in Figure 3, can be obtained for instance: 

RlvR(i_ >2 )v^(2->10)vR(lo-»ll)vR(ii_ > i2) = R 12 
RiOR (1 ^ 2 )QR(2^10)QR(10^13)QR(13^17) = Rl7 

The operator expressions are evaluated in Appendix F. 
Similarly, the next sequences are definable in principle, 

R 1 0R (1 _ 2) 0R (2 _3)0R(3^4)0R(4^5)0R(5^6) = R 6 

Ri0R (1 ^ 7) 0R (7 ^ 8 )0R(8^9) = R 9 
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Rl20R(i2— 4) = R4 
RlOR( 1 ^2)^R(2^10)^R(10^13)^R(13^14)^R(14^15)^R(15^16) = Rl6 

In general, we denote a node divergence 'R a to Rb 
(=R a 0R (a _^ b) = R b )' and 'R a to R c (=R a 0R (a ^ c) = R c )' as 'R a 
[(0R( a _ >b) )(0R( a ^ c ))]' (a,b,c: non-negative integers); here '( ) 
( )( )...' meaning simple juxtaposition. All paths belonging 
to the operational tree of Figure 3 can then be described/ 
recorded, for instance, as the sequence 

RlOR (1 ^ 2 ) [(0R (2 ^3)0R(3^4)0R(4^ 5 )0R(5^6)) (0R (2 ^ 7 ) <>R(7^8) 

0R (8 ^ 9) )(0R ( 2^io)[(0R(io^ii)0R(n^i2)0R (1 2^4)) 
(OR (10 ^ 13) [(0R (1 3^ 14) 0R (14 ^ 15) 0R (15 ^ 16) ) (0R (13 ^i7))] )] )] 

(ix) 

To display for easy recognition, for example, end states 
like 'R 6 , R9, Ri 6 , and R17' and divergence point 'R 4 ' a no- 
tation '(=Re), (=R9), (=Ri6) and (=Ri7)' might be consid- 
ered. Hence, 

RlOR(l^ 2 ) [(0R ( 2^3)0R(3^4)0R(4^5)0R( 5 ^6)(= R6))(0R( 2 ^7) 

OR (7 ^ 8) OR (8 ^ 9) (= R9))(0R (2 ^io) [(OR ( io^n)OR ( 11-. 12) 

0R (12 ^4) (= R 4 ) ) (OR(10^13) [(<>R(13^14) 0R (14 ^15) 

OR (15 ^i 6) (= Ri 6 ))(0R (13 ^i7)(= R17))])])] 

M 

Moreover, composition with an operator as in operating 

On 'R (3 _ 4) [(0R( 4 -h. 5) 0 R(5 _ 6))(0R(4 - 8) 0 R(8 _ 9) 0 R(9 - 10)) 

(0R( 4 _ > i5) 0 R(i 5 _i6)) ...] from the left-hand side by 
'R 3 '. The subsequent result can be expressed in accord- 
ance with the single scheme presented in Figure 3, 



R30R(3^4)[(OR(4^ 5 )OR(5^6))(OR(4^8)OR(8^9)OR(9^10)) 
(0R(4^1 5) vR 

(15->16) )•••] — R6, RlO, Rl6, ••• 

(xi) 

Note that the above descriptions (ix)-(xi) express one- 
to-many functionality. However, we think that these 
formulae are the algebra equivalent to the single oper- 
ational tree as exemplified by Figure 3. These play the 
algebraic role in composite record-keeping in applied 
fields such as medicine. In this formalism, any possible 
result 'Rj' (€set R) is obtained and traceable from any 
state 'Rk' under operations involving a plurality of ele- 
ments belonging to a single set R. 

Additionally, we can include data mining in a more 
symbolic/abstract way as follows. For an arbitrary j (j = 
1, 2, 3,..., m), a hierarchical-cluster-like expression can 
be defined [12]. For instance, if a partition of Rj is a set 
of subsets H = dRj, 2 Rj, sRj,---, r Rj) such that (1) Rj e H; 
(2) for all single sets s Rj in Rj, s Rj e H; and (3) ' s Rj n t Rj € 
{([), s Rj, t Rj}' for all s * t = 1, 2,..., r. That is, condition (3) 
means that either any two clusters ' s Rj and t Rj' are dis- 
joint, or one cluster is contained entirely inside the 
other, and every individual Rj is contained in at least one 
cluster larger than itself. Note that if ' s Rj n t Rj = 4>' for all 
s * t, then the hierarchy becomes a partitioning. Hence- 
forth, reference to a hierarchy implies that ' s Rj n t Rj = §' 
for at least one set of (s, t) values. In the previous ex- 
ample (vii), 'Rj' could be expressed in hierarchical-cluster 
notation where there are eight clusters (and relabeling 
within clusters) as shown in Figure 4. If Rj comprises 
'iRj 1 and 2 Rj X ; the first level of hierarchy, 'Rj = iRj 1 u 2 R)' 
holds. At the second level, 'iRj 1 = nR, 2 u 12 R 2 ' = [Xqji 
(mod 7) I X (j)2 (/24 hrs)], ' 2 r/ = 21 Rf u 22 Rf = [X (j)3 
(/mm 3 ) I X(j) 4 (mEq/l)|X(j) 5 (...)], whereas at the third 
level, ' 22 Rf = 221 Rf u 222 Rf = [X (j)4 (mEq/l)|X (j)5 (...)], 221 Rf 
= [X (i)4 (mEq/1)], 222 Rf = [X (j)5 (...)] (Figure 4). Hence we 



Rj = [X 0)1 (mod 7)1X^,2 (/24hrs)IX a)3 (/mm^lX^ (mEq/DIX^ (...)] 




,R/ = fXgjj (mod 7)IX 0)2 (/24hrs)], 2 Rj' = rx ffl3 (/mm^lX^ (mEq/l)IX 0)5 (...)] 





n Rj 2 = Pernod?)], ijR J J =pC 0)J (/24hr s )], 21 Rj 2 = [X m (/mm 3 )], n Rf= fX^ (mEq/niX^ (...)] 




2 2 iRj 3 = [X ovt (mEq/1)], 222 Rj 3 = [X m5 (...)] 

Figure 4 Systematization of "hierarchical clusters". A hierarchical cluster is defined as the necessary class of subsets needed to decompose 
the set to single units. The top level is the entire dataset and that is always decomposable into base units. 
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obtain the complete set Rj = {X iin , X (j)2 , X (j)3 , X {j ) 4 , X (j)5 } = 
[X 0)1 (mod 7) | X m (/24 hrs) | X (j)3 (/mm 3 ) | X (j)4 (mEq/1) 
| X(j) 5 (...)]• A hierarchy has additional levels as necessary 
to reach single units at its base [12]. The top level is the 
entire dataset 'Rj' and that is always composable using 
base units. That is, arbitrary 'R k ' and 'Ri' can be com- 
bined into a single dataset as with 'R^ = [X(i < )i| X(k) 2 | 
...| X (k)a ]' and 'Ri = [X (1)1 | X (1)2 |...| X (1)b ]', '{R k , =Rj 



[X, 



Oil 



x (i)2 



X 



(j)a 



X 



(j)a+l| X (j)a+2 



X (j)a+b] 



(a,b; 



positive integers). In this way, classical datasets that are 
classified in the Stevens scales of measurement could be 
mined and combined on a higher abstract structure level. 
To help better understand the concept, a sequence of 
schemes illustrating the principles of our model is pre- 
sented in Figure 5. 

Subject to future improvements, we envisage that this 
compact description is versatile to provide better data 
mining/ data usage than from existing methods, although 
a final version is far from complete at this early stage. 



§3. Supplementary suggestions and limitations 

If the four arithmetic operations are appropriate in hand- 
ling the values from clinical assessments, representation 
by "Ratio scales" (in some cases, the "modular scale" with 
suitable modulo number previously mentioned) might be 
effective in describing the clinical treatments or studies. 
The "Numerical rating scale (NRS)" with range '0-10' 
[13,14] illustrates the point where the modulo 11 additive 



group 'Zn' arises as a natural modular scale. In contrast, 
similar approaches might be difficult for a "visual analogue 
scale" [15,16] where values could take any real number. 

Whereas rating scales systemized as abstract algebra- 
like form may enable a more generalized/sophisticated 
understanding, establishing a link between fields of clin- 
ical medicine and abstract algebra, and mixed states and 
operators in vector-like notation as in (vii)-(xi), does not 
always assure more concise manipulations. A mixed 
treatment as exemplified in (vii)-(xi) might not always 
yield optimal results at present. In general, combining 
group and field-like structures within 'Rj' may cause some 
confusion in handling the 'Rj's although benefits accrue 
through operational compliance and convenience in deal- 
ing with the abstract algebra. For description and records, 
a vector-like definition 'Rj' may not always be advanta- 
geous in which only the four types I)-IV) are used (par- 
ticularly for '1)', the "nominal scale", where systematization 
of operation seems to be impossible). Nevertheless, we 
infer that in the handling of operations in mixed-notation 
like 'Rj! the classification and synthesis of scales of meas- 
urement in some group/field-like form may be devised in 
a more rigorous methodology in future improvements. 

That apart, similar, redundant, and obscure compo- 
nents may have been incorporated into the 'Rj's descrip- 
tion without discretion. The 'Rj' in such instances loses 
validity and versatility in terms of a concise composition 
of scales. This is considered to result from the fact that a 
total state of a certain disease or a condition of a patient 



Ratio scale 



Interval scale 



Ordinal scale 



Nominal scale 



"Stevens classification" 



, 0 R,- 




"Operational tree" 



Abelian additive 
group 




"Abstract algebra-like systematization" 




"Hierarchical clusters" 



Figure 5 Schemes for data mining and combination in some higher abstract structure level. First, the classical dataset, classified by the 
four types of scales of measurement (Stevens classification), is re-interpreted as a group/field-like operational structure. Second, a vector 'Rj' is 
defined that is composed as the product of each type of operation for all datasets other than for those classified as "Nominal scale". Third, the 
'Rj's are constituted as an "operational tree". Fourth, 'Rj' with any arbitrary j (j = 1, 2,. . ., m) could be mined and combined as "hierarchical clusters". 
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is not always composable or describable via the combin- 
ation of partial components. This implies that a larger 
number of components is not always desirable for as- 
sessment or rating scales. 

Unfortunately, almost all current assessment scales in 
medicine are handled as if they were ratio scales although 
almost all are just ordinal scales. That might introduce 
considerable futility and/or waste of scientific resources. 
As previously indicated, some clinical scales (e.g., TNM 
classification) should be represented as an ordinal scale 
accompanied by '0' with no absolute need for a quantita- 
tive calibration (modular scale). Although a combination 
composed of entirely ratio scales seems to be difficult or 
impossible, we believe at least that appropriate operational 
structures (e.g., group, field) should always be selected that 
satisfied the conditions in instances like composition of 
scale, analysis, and interpretation of the results. These 
structures must be recognized clearly by users per each 
assessment to avoid misestimation, overconfidence, and 
complacency in scales. 

Conclusions 

The Stevens classification of scales of measurement can 
be re-interpreted and modelled as some abstract algebra- 
like systematization. Moreover, a vector-like notation using 
mixed types of operations and a hierarchical structure-like 
systematization are possible enabling a sophisticated means 
to classify, update, monitor, and forecast patient treat- 
ments. Better data mining/data usage and efficacy is ex- 
pected and will be considered in future studies. 

Appendix 

Appendix A 

Using 'N = 5' for the scale of a certain symptom or clin- 
ical finding with set Z 5 ={0, 1, 2, 3, 4}, we suppose 'Xi = T 
(e set Z 5 ) for the initial state and X 2 = 4' (e set Z 5 ) for the 
final state. Expressed as 'Xi*X(i^ 2 ) = X 2 ', the change 
can be determined as 'X(i^ 2 ) = X 2 - Xi (mod 5) = 4 - 
1 (mod 5) = 3 (mod 5) (e Z 5 )'. 

Appendix B 

Suppose 'the body-temperature thermometer' (deg C; de- 
gree Celsius) changes from 'Ti = 36.7 (deg C)' to 'T 2 = 35.1 
(deg C)'. Because 'Ti °T(i_» 2 ) = T 2 J an operator part is cal- 
culated as 'T (1 ^ 2 ) = T 3 - Ti = 35.1 - 36.7 (deg C) = - 1.6 
(deg C)'. For an another example, when there are two 
clock times for the onset of sleep 'ti = 21 (/24 hrs)' and 
't 2 = 19.5 (/24 hrs)! the operator part is determined as 't(i_> 2 ) 
(/24 hrs) = t 2 - ti (/24 hrs) = 19.5 - 21 (/24 hrs) = -1.5 
(/24 hrs) = 24 -1.5 (/24 hrs) = 22.5 (/24 hrs)'. 

Appendix C 

Provided [WBC] changes in the following manner: '5000 
(/mm 3 ) (= Wi) -^18000 (/mm 3 ) (=W 2 ). Because 'Wj # W 



d^ 2 ) = W 2 ', the operator denoted by 'W(i^ 2 )' for addition 
is derived from 'W (1 ^ 2) = W 2 - W x = 18000 - 5000 = 
13000 (/mm 3 )'. Collectively, the operator is determined by 
division: 'W ( i^ 2) = W 2 /Wj =18000/5000 (= 3.6) (/mm 3 ); 

For an another example, if '[Na]i = 145 (mEq/1)' changes 
into '[Na] 2 = 128 (mEq/1); because '[Na]i # [Na] ( i_> 2 ) = 
[Na] 2 ; the operator for addition is obtain from '[Na](i_» 2 ) = 
[Na] 2 - [Na]i = 128 - 145 = - 17 (mEq/1)'. Collectively, 
the operator for division is '[Na](i^ 2) = [Na] 2 /[Na] x = 128/ 
145 (mEq/1)'. 

Appendix D 

R ( i^2) = R 2 - Ri 

= [5 (mod 7) | 19.5 (/24 hrs) | 18000 (/mm 3 ) | 128 
(mEq/1) | X (2)5 (...)] - [2 (mod 7) | 21 (/24 hrs) | 5000 
(/mm 3 ) | 145 (mEq/1) | X (1)5 (...)], 

= [5 - 2 (mod 7) | 19.5 - 21 (/24 hrs) | 18000 - 5000 
(/mm 3 ) | 128 - 145 (mEq/1) | X (1 ^ 2)5 (...)], 

= [3 (mod 7) | - 1.5 (/24 hrs) | 13000 (/mm 3 ) | - 17 
(mEq/1) | X (1 ^ 2)5 (...)]. 

R(2^3) = R3 _ R2 

= [3 (mod 7) I 22 (/24 hrs) | 7000 (/mm 3 ) | 158 (mEq/1)] | 
X( 3 )5 (•••)] - P (mod 7) I 19.5 (/24 hrs) | 18000 (/mm 3 ) | 
128 (mEq/1) I X (2)5 (...)], 

= [3 - 5 (mod 7) I 22 - 19.5 (/24 hrs) | 7000 - 18000 
(/mm 3 ) I 158 - 128 (mEq/1) | X (2 ^ 3)5 (•••)], 

= [- 2 (mod 7) I 2.5 (/24 hrs) | - 11000 (/mm 3 ) | 30 
(mEq/1) I X (2 ^ 3)5 (...)], 

= [5 (mod 7) I 2.5 (/24 hrs) | - 11000 (/mm 3 ) | 30 
(mEq/1) I X (2 ^ 3)5 (•••)]• 

Appendix E 

RiOR (1 ^ 2) OR (2 ^ 3) = [2 (mod 7) | 21 (/24 hrs) | 5000 
(/mm 3 ) I 145 (mEq/1) | X (1)5 (...)]0[3 (mod 7) | - 1.5 
(/24 hrs) I 13000 (/mm 3 ) | - 17 (mEq/1) | X (1 ^ 2)5 (...)]0 
[5 (mod 7) I 2.5 (/24 hrs) | - 11000 (/mm 3 ) | 30 (mEq/1) 

I X( 2 _>3) 5 (•••)]> 

= [2 + 3 + 5 (mod 7) | 21 - 1.5 + 2.5 (/24 hrs) | 5000 + 
13000 - 11000 (/mm 3 ) | 145 - 17 +30 (mEq/1) | Xp )5 (...)], 

= [10 (mod 7) I 22 (/24 hrs) | 7000 (/mm 3 ) | 158 
(mEq/1) I X (3)5 (...)], 

= [3 (mod 7) I 22 (/24 hrs) | 7000 (/mm 3 ) | 158 (mEq/1) 
I X (3)5 (...)]. 

Appendix F 

For the 3rd and 4th components, only addition/subtraction 
is demonstrated collectively for ease in comprehension. 

R (2 ^io) = Rio - R 2 = [0 - 5 (mod 7) | 17 - 19.5 (/24 hrs) 
I 9000 - 18000 (/mm 3 ) | 130 - 128 (mEq/1) | X (2 ^ 10)5 
(...)] = [- 5 (mod 7) I - 2.5 (/24 hrs) | - 9000 (/mm 3 ) | 2 
(mEq/1) I X (2 ^ 10)5 (...)], 

R(io^ii) = Rn - Rio = [6 - 0 (mod 7) | 20 - 17 (/24 hrs) 
I 20000 - 9000 (/mm 3 ) | 149 - 130 (mEq/1) | X (10 ^ ll)5 
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(...)] = [6 (mod 7) | 3 (/24 hrs) | 11000 (/mm 3 ) | 19 (mEq/1) 

I X(i 0 _>ll)5 (••■)]. 

R(ii^i2) = R12 - Ru = [4 - 6 (mod 7) | 23 - 20 (/24 hrs) 
| 6000 - 20000 (/mm 3 ) | 140 - 149 (mEq/1) | X (11 ^ 12)5 
(...)] = [- 2 (mod 7) | 3 (/24 hrs) | - 14000 (/mm 3 ) | - 9 
(mEq/1) | X (11 _ 12)5 (...)], 

R (10 ^i3) = Ris - Rio = [1 - 0 (mod 7) | 18 - 17 (/24 hrs) 
| 5000 - 9000 (/mm 3 ) | 135 - 130 (mEq/1) | X (10 ^i 3 )5 
(...)] = [1 (mod 7) | 1 (/24 hrs) | - 4000 (/mm 3 ) | 5 (mEq/1) 

| X(io_>13)5 (...)]> 

R ( i3^i7) = R17 - Ri 3 = [2 - 1 (mod 7) | 23.5 - 18 (/24 hrs) 
| 3000 - 5000 (/mm 3 ) | 150 - 135 (mEq/1) | X (13 ^ 17)5 (...)] = 
[1 (mod 7) | 5.5 (/24 hrs) | - 2000 (/mm 3 ) | 15 (mEq/1) 

| X( 13 ^i7) 5 (...)]. 
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